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We constructed a single-chain Fv antibody library that permits human complementarity-determining 
region (CDR) gene fragments of any germline to be incorporated ^^^^^^^^inatorially into the^ 
positions of the variable-region frameworks VH.DP47 and VL-DPL3. A library of 2 x ndepe" J^^^ 
transformants was screened against haptens, peptides, carbohydrates, and proteins, and the selected 
antibody fragments exhibited dissociation constants in the subnanomolar range. The antibody genes in 
this library were built on a single master framework into which diverse CDRs were allowed to recomb^ne 
These CDRs were sampled from in vivo-processed gene sequences, thus potentially optimizing the levels 
of correctly folded and functional molecules, and resulting in a molecule exhibiting a lower computed 
immunogenicity compared to naive immunoglobulins. Using the modularized assembly process to incor- 
porate foreign sequences into an immunoglobulin scaffold, it is possible to vary as many as six CDRs at 
the same time, creating genetic and functional variation in antibody molecules. 
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Highly diverse molecular libraries have recently become a vital tool 
in the search for molecules with novel properties. The antibody mol- 
ecule in particular has been the focus of extensive development of 
different combinatorial strategies in order to obtain antibodies with 
novel specificities. These strategies include the reassembly of natu- 
rally occurring genes encoding the heavy- and Ught-chain domains 
from either immune^-^ or nonimmune B-cell sources\ or introduc- 
tion of synthetic variability^^*^ into the individual complementarity- 
determining regions (CDRs), i.e, the hypervariable parts of the vari- 
able domains that interact with an antigen. 

Previously we demonstrated an approach to introduce genetic 
diversity in antibody libraries, where diverse natural CDRs are com- 
bined into an antibody framework^^ using only CDR-encoding gene 
fragments originating from sequences formed and "proofread" in 
vivo. Such proofread peptide sequences that have evolved and devel- 
oped in vivo in a specific biological context may have superior func- 
tional qualities compared with those produced in vitro. A high 
degree of functional variation is achieved by means of simultaneous 
and random combination of sbc biologically derived CDRs, which 
can extend the genetic diversity beyond what is naturally created in 
the immune system. In addition, the single master framework into 
which diese CDRs are introduced was selected to be compatible with 
the bacterial expression system employed. Finally, the modular 
design of the approach allows for iterative CDR shuffling to improve 
specificities and affinities, and the contribution of individual CDRs 
can be analyzed by systematic regional changes. 

In order to demonstrate the functionality of this concept, we 
have constructed a large human antibody single-chain fragment 
(scFv) library using these principles. This Hbrary has been success- 
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fully used for the isolation of antibody fragments specific for a num- 
ber of different ligands, including peptides, proteins (of human and 
nonhuman origin), carbohydrates, and haptens. These antibodies 
exhibit dissociation constants in the subnanomolar range. The data 
clearly demonstrate that a single-antibody framework can harness 
diverse CDR sequences previously not found in combination, 
including CDR loops with foreign canonical structures, and still 
retain the functionality of the antibody structure. 

Results and discussion 

Library design using proofread gene segments. We have construct- 
ed an scFv antibody fragment gene hbrary of 2 x 10' members, des- 
ignated n-CoDeR. The genes were built using the VH DP47 and the 
VL DPL3 immunoglobulin genes as single master frameworks^^ 
since they are (1) well expressed/displayed in bacteriaUphage 
systems^ and (2) present in vivo in most human individuals^^^^ 
Sequences encoding in vivo-formed CDRs from rearranged 
immunoglobulin genes of different germline origin were combined 
into the master framework. This was achieved by amplification of 
CDRs with primers specific for the DP47 and DPL3 frameworks^ ^ 
and overlap extension of single-stranded DNA (ssDNA) to create 
antibody genes encoding variable hght and heavy chains^-^^-^l xhe 
principle is outlined (Fig. 1). 

An extremely large genetic variation can be created by random 
combination of six in vivo-formed CDR sequences, although an 
exact calculation of the variation cannot be performed, since human 
germUne genes acquire additional genetic variation in vivo from 
somatic mutation processes and V(D)J recombination. The assem- 
bled 5cFv were cloned into the pFAB5c-His display vector^^ and 
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Figure 1. Schematic diagram for the construction of n-CoDeR, using 
in vivo-proofread gene segments, cDNA is produced (step 1) from 
IgM -G and -A producing cells, and this cDNA {step 2) is used as 
template for the specific amplification of each CDR individually {step 
3). These amplified CDRs are mixed with oligonucleotides encoding 
framework regions (step 4), and intact genes encoding variable light 
and heavy domains are synthesized, using overlap extension PGR 
(step 5). The synthesized variable light and heavy domains are joined 
together to form a complete gene encoding scFv antibody fragment 
with de novo gene sequences (step 6). 

transformed into Escherichia coli by electroporation. The resulting 
library contained 75-80% clones with an open reading frame 
(ORF), as determined by western blot analysis and by sequencing. 
Restriction enzyme digestion of 32 scFv genes using BamHl and 
5srNI and sequencing of 52 clones demonstrated that the library 
exhibited a high degree of variabihty (data not shovm). 

In the present investigation we demonstrate the functionaUty of 
the molecular design approach of varying all six CDRs and selecting 
a variety of specific antibody fragments with novel gene sequences, 
constructed using gene segments from antibodies of different 
germline origins. 

Genetic diversity using in vivo-formed gene segments. The use of 
in vivo-formed and proofread gene segments has several advantages 
compared to in vitro-designed and constructed synthetic oUgonu- 
cleotides employed in the creation of diverse gene libraries (e.g. refs 4> 
5, 18, 19). Proofread segments are optimized with regard to function- 
ality and will most likely not encode antigenic T-cell epitopes, since 
they have been adapted and processed by the immune system. 

Analysis of 10 selected scFvs from the n-CoDeR library, specific for 
different antigens, did not show an increased presence of T-cell 
epitopes (see Experimental Protocol), as compared to variable 
domains from 10 normal human antibodies of IgG type, specific for 
nonhuman proteins (summarized in ref 14). In contrast, the analysis 
indicated that scFv selected from n-CoDeR contained fewer T-cell 
epitopes than the natural antibody variable regions (Table 1 ). A possi- 
ble explanation for this is that the natural antibodies contained 



Table 1 . Analysis of potential T-cell epitopes in scFv-fragments 
sequences from n-CoDeR using peptide threading* 

A. Results for heavy chain 



n-CoDeR 


No. of epitopes 


Mature VH 


No. of epitopes 


Y8 


3 


GF4H 


4 


STR5 


4 


KL4H 


7 


X4 


2 


X753 


7 


M16 


3 


S55290H 


1 


LB 


7 


S55287H 


6 


LI 


0 


M979 


5 


F5 


4 


L26906H 


6 


CT173 


4 


L268 


5 


SM159 


4 


L08090H 


4 


HSP17 


4 


L08083H 


9 


Average 


3.5 


Average 


5.4 


B. Results for light chain 






n-CoDeR 


No. of epitopes 


Mature VL 


No. of epitopes 


Y8 


4 


D84136L 


5 


STR5 


1 


M97922L 


9 


X4 


3 


D84138L 


2 


M16 


3 


S55292L 


2 


L8 


4 


75387L 


2 


LI 


4 


K14L 


1 


F5 


2 


L26899L 


1 


CT173 


3 


GF4L 


5 


SM159 


2 


L26896L 


3 


HSP17 


4 


S55289L 


6 


Average 


3 


Average 


3.6 



'Peptide threading analyzes the potential binding of every possible 13-nner 
from the proteins of interest to hunnan HLA-DR allotypes. The linker was 
removed before analyses^^. 

somatic mutations in their framework regions, whereas, by design, the 
framework regions of the n-CoDeR-derived antibody fragments were 
all germline. Importantly, no T-cell epitopes were generated as a func- 
tion of mixing CDRs from other germlines with the master frame- 
work, whereas inclusion of the standard (04-8)3 linker between VH 
and VL often resulted in the generation of T-cell epitopes (data not 
shown). It is thus possible that specific binders derived from n-CoDeR 
and converted into whole antibodies would exhibit even lower 
immunogenicity compared to antibodies from other phage libraries. 
In contrast to biologically derived segments, the design of synthetic 
oligonucleotides is based on our current incomplete knowledge of the 
variation in the immunoglobulin repertoire, which may result in 
nonoptimal in vitro variation, including the possibility of encoding 
antigenic epitopes. Synthetic oligonucleotides are also associated with 
incomplete synthesis, resulting in deletions and/or mutations, which 
will reduce the frequency of functional clones. To overcome this prob- 
lem, the oligonucleotides used to create the n-CoDeR were produced 
by polymerase chain reaction (PGR), rather than chemically synthe- 
sized, which increases the frequency of functional genes (Kobayashi 
and Soderhnd, unpubUshed data)^^. The concept of DNA recombina- 
tion using DNA shuffling technology has been described previously^°. 
This technique was employed for the random combination of in vivo- 
formed gene sequences^^-^. However, the size and location of DNA to 
be recombined was undefmed. In contrast, the n-CoDeR technology 
allows sequences of defined size and location to be combined. 
However, both these examples point to the advantage of using highly 
functional, in vivo-formed DNA sequence in gene library constructions. 

Selection of functional antibodies from n-CoDeR. The designed 
library was used for the selection of several specific antibody frag- 
ments against haptens, peptides, proteins, and carbohydrates (Table 
2). Selections were carried out using biotinylated antigen and subse- 
quent capture on streptavidin- coated magnetic beads^'^\ The speci- 
ficity of selected clones was confirmed by phage enzyme-linked 
immunosorbent assay (ELISA) against several unrelated antigens. 
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Table 2. Selection parameters of clones from n-CoDeR library^ 



Antigen 



(3-Gaiactosidase 
Cholera toxin (p-subunit) 
FITC 

Human cell surface 
antigen 

Human leptin 

Human prostate- 
specific antigen 

Lewis X 

Lewis Y 

MUC1 

Phenyloxazolone 
Streptavidin 

^N/D, not determined. 



Table 3. Dissociation constant {Kd o1 some monomeric scFv 
antibodies selected from the n-CoDeR library' 



Fiaure 2. Top view of the scFv antibody fragments specific for 
fluorescein isothiocyanate (FITC; clone FITC8). MUC1 peptide (Muc159), 
and cholera toxin (p-subunit) (clone CT17) showing the molecular 
surface of the individual combining sites. The homology models were 
generated using the program MODELER^s, gnd templates were from ttie 
Brookhaven Protein Databank (see Experimental Protocol) (A) The 
surface potential of the scFv fragments was mappe<3. onto the models. 
Positive potential (>15 mV) is colored blue, neutral potential (0 mV) is 
colored gray, and negative potential (5 -15 mV) is colored red. Arrows 
mark potential antigen-binding sites. (B) CDR residues, identified as 
frequent contact residues from x-ray crystallographic analysis of a large 
set of antigen-antibody complexes^^, were mapped onto the models. 
The CDRs are color-coded as follows: L1, dark green; L2, blue; L3, 
magenta; H1, red; H2, light green; H3, cyan. All the figures were 
generated with GRASP*. 

Most of the antigens gave high frequencies of binders after three 
rounds of selection. Binders against carbohydrate antigens could 
also be selected, albeit at a somewhat lower frequency. Affinities for a 
set of selected antibody fragments were then determined using the 
monomeric form of the scFv antibody fragment (Table 3). 

A central question in the evaluation of n-CoDeR was whether anti- 
body fragments encoded by the novel genes would be functional with 
respect to antigen binding. Our library design was based on the princi- 
ple that a specific single framework had the potential to include CDRs 
derived from other germline gene sequences. However, structural 
incompatibility between these foreign CDRs and the fbced -framework 
could potentially prevent the formation of functional antibody-com- 
bining sites. This was not found to be the case. A number of different 
antibody specificities could be generated, demonstrating ftmctional 
strucuiral diversity in the antigen-combining sitesrAn analysis using 
homology studies (see Experimental Protocol) revealed that the three- 
dimensional structure of three specific n-CoDeR antibodies exhibited 
an overall immunogiobulin-like fold. This fold was verified to be struc- 
turally most similar to antibody molecules according to the Dali 
program"; the best matches giving a Z-score of -16.0 and an r.m.s. 
deviation for Ca rigid-body super imposition of -0.6 A. Analysis of the 
solvent- accessible surface showed that the surface potential and in 
particular the topology of the combining sites differed among die three 
scFv fragments (Fig. 2A). The relationship between the topography of 
the combining site and the type of antigen recognized has emerged 
from previous x-ray crystallographic studies of antigen-antibody 
complexes^^ At least three classes of sites binding to haptens (cavity), 
peptides (groove), and proteins (planar) have been described. When 
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No. of 


Positive 


No. of 


selections 


clones/ 


unique 




tested 


1 ICO/ 




clones 


sequenced 






clones 


3 


49/59 


1/3 


3 


28/55 


3/5 


3 


85/87 


5/5 


4 


10/120 


5/10 


3 


B/60 


5/5 


4 


6/120 


4/4 


3 


4/53 


3/3 


3 


2/120 


2/2 


3 


51/169 


6/6 


3 


55/59 


N/D 


2 


7/14 


4/4 



Clone scFv 



K<,(x10-8 M-n 



FITC8 

FITC19 

F1TC18 

FITC9 

CT17 

CT15 

MUC41 

MUC23 

MUC46 

MUC20 

MUC44 



0,9 
1.9 
2.0 
3.2 
3.1 
7.6 
67 

123 

200 

270 

420 



>CT, cholera toxin (^-subunit); FITC, fluorescein isothiocyanate; MUG, mucin-1 . 

analyzing the accessible surface of our three models in more detail, we 
found a deep cavity within the center of the anticipated combining site 
of the anti-hapten antibody, fluorescein isothiocyanate (FITC) and an 
open groove on the anti-peptide antibody (MUCl peptide), whereas 
no obvious site could be identified on the anti-protein antibody 
against cholera toxin (p-subunit) (Fig. 2A). Several positively charged 
(Fig. 2A) and aromatic residues (data not shown), known from x-ray 
crystallographic studies of an anti-FFTC antibody to be of importance 
for binding of antigen^^ are present within ti\e deep cavity detected on 
the anti-FITC antibody. To further define the potential antigen- 
binding site, we mapped putative contact residues within the CDRs 
onto die modeled structure of tiiese tiiree scFv fragments (Fig. 2B). A 
potential combining site is clearly highlighted witiiin the deep cavity 
observed on die anti-FITC scFv, and the cavity is ahnost completely 
formed with these residues. In the case of the anti-MUCl scFv, the 
CDR residues duster more along an open groove and thus may consti- 
tute a direct contact site for die peptide. In contrast, a considerably 
larger and apparently flatter, more undulating surface area is highlight- 
ed on die anti-cholera toxin scFv, which is indicative of die much larger 
part of die paratope generally used for bmding to a protein antigen 
instead of a hapten (600-900 vs. 100-300 A^). 

The view of high frinctionality of n-CoDeR is supported by the 
fact that subnanomolar affinities can be directly selected from our 
library, a performance previously only reported from larger 
Ubraries^'-'^. The fact that carbohydrate clones specific for Lewis Y 
and Lewis X, previously found only in an immunized antibody gene 
Hbrary^^ can be direcdy selected from the library, points to large 
functional genetic variation. Finally, the selected VH framework is 
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Table 4A. Germline gene origin of CDRs found in unselected clones of the n-CoDeR library 
CDRH1 CDRH2 CDRH3 QDRL1 



Clone 



CDRL2 



CDRL3 



Origin 



No. of 



Origin 



No. of No. ofaa Origin No. of Origin No. of 



Origin 



No. of 



nt mut 




nt mut 






ntmut 




nt mut 




nt mut 


0 


3-30, 30-30-5 


0 


16 


1-44 


3 


1-40, 1-50 


0 


1-36. 1-44 


0 


0 


3-11, 3-48 


8 


14 


1-44 


1 


1-44, 1-47 


4 


1-36, 1-44 


0 


0 


3-7 


1 


8 


1-44 


0 


1-51 


5 


1-40 


0 


0 


3-23 


0 


12 


1-40 


0 


1-44 


2 


1-47 


3 


2 


3-23 


5 


10 


1-36 


6 


1-44 


3 


1-36, 1-44 


0 


1 


3-7 


8 


10 


1-47 


5 


1-44 


2 


1-47 


0 


4 


3-11, 3-23, 3-48 


14 


20 


1-40 


5 


1-47 


1 


1-47 


3 


2 


' 3-23 


9 


13 


1-44 


2 


1-40, 1-50 


3 


1-40 


0 


1 


3-23 


10 


9 


1-47 


6 


1-44 


1 


1-47 


0 


1 


3-21 


0 


7 


1-44 


5 


1-44 


0 


2-11 


2 



03 3-7 
09 3-9, 3-43 

011 3-30,3-30-3,3-64 

012 3-30,3-30-5,3-33 
O20 3-30, 3-30-3, 3-47, 3-64 

022 3-48 

023 3-11,3-13 
025 3-74 

029 3-30, 3-30-5, 3-33 
032 3-23 

»Pseudoaenes with sequence similarity identical to functional or ORF variants have not been included if such genes exist. Orphan VH loci found on chj-o^Qsomes 
15 and ifwere^^^^ considered in this comparison. The length of the amplified sequences are as follows: CDRH1 , 1 9 nt; CDRH2. 78 nt; CDRL1 , 41 nt; CDRL2. 21 nt. 
nt, Nucleotides; mut, mutations; aa, amino acids. 



Table 4B. Germline gene origin of CDRs found in selected clones of the n-CoDeP library* 

Clone Specificity CDRH1 CDRH2 CDRH3 CDRL1 CDRL2 CDRL3_ 



Origin No. of Origin No. of No. Origin No. of Origin No. of Origin No, of 

ntmut ntmut ofaa ntmut ntmut "^"^ 



PSA22 

GT9 

CT17 

FITC8 

FITC19 

Lepi 

Lep4 

MUC20 

MUC46 

Strep6 



Prostate-specific antigen 

Cholera toxin p-subunit 

Cholera toxin p-subunit 

Fluorescein 

Fluorescein 

Leptin 

Leptin 

MUG1 tandem repeat 
MUC1 tandem repeat 
Streptavidin 



3-48 


2 


3-30, 3-30-5 


4 


12 


1-44 


6 


1-47 


0 


1-47 


4 


3-23 


4 


3-7 


11 


14 


1-44 


2 


1-40, 1-50 


3 


1-36, 1-44 


2 


3-60, 3-62^ 


0 


3-23 


1 


20 


1-36, 1-44 


6 


1-40, 1-50 


0 


1-36. 1-44 


0 


3-7 


1 


3-23 


8 


8 


1-40 


0 


1-40, 1-50 


1 


1-47 


0 


3-11 


0 


3-23 


8 


8 


1-36 


2 


1-47 


0 


1-47 


1 


3-49 


1 


3-7 


1 


11 


1-44 


3 


1-36 


0 


1-36, 1-44 


6 


3-20 


0 


3-23 


1 


14 


1-36.1-44 


11 


1-40, 1-50 


0 


1-47 


0 


3-11 


1 


3-30, 3-30-5 


5 


9 


1-36, 1-44 


6 


1-40, 1-50 


1 


1-36, 1-44 


0 


3-7 


0 


3-30-3 


5 


9 


1-44 


6 


1-51 


1 


1-47 


4 


3-11 


1 


3-23 


2 


13 


1-44 


0 


1-41, 1-51 


0 


2-18 


1 



with sequence similarity identical to functional or ORF variants have not been included if such genes exist. Orphan VH loci found on chromosomes 15 
>t considered in this companson. The lengths of the amplified sequences are as follows: CDRH1 , 1 9 nt; CDRH2, 78 nt; CDRL1 , 41 nt; CDRL2. 21 nt. 



'Pseudogenes \ 

and 16 were not considered in this companson. 
nt, Nucleotides; mut, mutations; aa, amino acids. 
^Both of these germline genes are pseudogenes. 



DP47, which belongs to the VH3 family, the menabers of which have 
been shown to have limited polyreactivity compared with members 
of other VH families^l This fact might also contribute to the func- 
tionality and specificity of the selected antibody fragments. 

CDR-usage in the antibody fragments. We sequenced a total of 
53 clones from the Ubrary that had not been selected for antigen 
binding, and found all of them to be mosaic; i.e. each clone con- 
tained combinations of CDR sequences of different germline origin 
(Table 4A). Some of the CDR sequences in the Ubrary were found to 
originate from unmutated germline sequences, whereas some 
sequences contained mutations. The frequencies and locations of 
these mutations exclude the PGR process as the predominant source 
of nucleotide substitutions (data not shown). Thus, the functional 
genetic variation in n-CoDeR arises from both the combination of 
CDRs originating from different germline sequences and by the 
presence of mutations in individual CDR sequences. These muta- 
tions are a consequence of the somatic mutation process. 
Furthermore, after sequencing the clones selected for antigen 
specificity, the antibody fragment genes were all of the expected size, 
without stop codons or deletions, and were also found to be mosaic 
(Table 4B). In no clone was the full-length original DP47 and DPL3 
germline sequence combination found. From the analyses of the 
selected clones, it is clear that the immunoglobulin scaffold has a 
thus far uncharacterized ability to harness foreign CDR loops. This 
incorporation of foreign CDRs into a single framework has similari- 
ties to the gene conversion event in some species, which allows for 
translocation of defined gene segments into a given gene sequence". 



No such in vivo mechanism has been described in the human 
immune system, and the flexibility of the immunglobulin scaffold to 
accept foreign gene sequences presented here demonstrates that 
functional human antibodies may be formed utilizing principles 
other than what evolution provides. 

Experimental protocol 

Isolation of in vivo proofread CDRs from various B-cell sources. To achieve 
largest possible variation in the Ubrary, B cells. originating from different 
individuals and from different lymphoid tissue/organs, were used as a source 
of CDRs. RMA was prepared from lymph nodes from seven different donors, 
peripheral blood mononuclear cells (PBMCs) from nine different donors, 
spleen from one donor, and tonsil RNA from five donors. The total number 
of B cells used was approximately 2 x 10^ Total RNA was prepared using an 
RNA isolation kit from Promega (Madison, WI), and cDNA synthesis was 
performed with Superscript II (Life Technologies Inc., Paisley, UK) according 
to the method of Engberg and coworkers^^; using 50 ^ig RNA per reaction. 
The cDNA synthesis reaction was primed with 3' primers specific for the 
human y, p., a, and X constant region. A total of 30 reactions (15 VH and 15 
VL) were performed, and to avoid bias all samples were treated separately in 
the following PGR amplification steps. Primers used in the PGR amplifica- 
tion of the heavy- chain CDRs were essentially as described^ ^ and the primers 
used for the amplification of light-chain CDRs were designed according to 
the same principle. For each CDR a pair of primers was used, homologous to 
the master framework. These primers allowed for the ampUfication of CDRs 
from a variety of germline sequences. The CDRs are defined as described in 
VBASE (http;//www.mrc-cpe.cam.ac.uk/imt-doc/pubhc/INTRO.htnil). Each 
of the sue CDRs was PCR amplified using TaqPlus Precision PCR System 
(Stratagene, La JoUa, CA), according to the manufacturer's instructions. 
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Library construction. PGR amplification of framework regions was per- 
formed usmg the same principle as for the CDRs, and a clone contaming the 
DP4^ and DPU frameworks'^ was used as template. Purification of the PCK 
nroduct from both CDR and framework ampUfications were pertormed 
essentiaUy as described^'. The smgle-stranded CDR and framework frag- 
ments were assembled into scFv genes in a two-step overlap e):tension PGR. 
usme TaqPlus Precision PGR System, as described^^. The assembled scFv 
antibody gene fragments were purified using QIAquick PGR Pu-^cation Kit 
(Qiagen HUden, Germany), digested with SfH and Nod (New England 
BTolabs. Beverly, MA), and punfied using QIAquick Gel Extraction Kit 
(Oiagen) They were then Ugated into the Nod/Sfil sites in the pR\B5c-His 
vector and transformed into electrocompetent TOPIOF' E. coh ceUs. 
AmpHfication of the library and phage stock production with R408 helper- 
phaee was done as described^^. , . . , , 

Selection of specific antibody fragments. Selection on biotinylated antigen 
was done using streptavidin- coated Dynabeads (Dynal AS.. Oslo. Norwy) as 
describedW3, except that bound phage were eluted with tr>-psin. as described . 
The amount of phage used was 1.5 x 10'^ c.f.u. in the first selection and approxi- 
mately 1 X 10^0 in later selectibn steps. Antigen concentrations ranged trom 
1 X 10-^ M to 1 X 10-'2 M. For phagemid amplification, log phase E. coh TOPIOF 
cells were infeaed with the eluted phage and grown in Luria-Bertam (LB) 
medium contaimng ampicilUn (100 ^g ml"^) and tetracycline (15 Hg ml" ) at 
37°C for ~3 h before infection with 20 times excess of helper phage R40S. 
Isopropyl-6-D-thiogalactoside (IPTG) was added to 0.1 mM, and growth was 
continued at 25^G for 13 h before the amplified phage were harvested, AAer 
selections, phage displaying antigen-bindmg antibody fragments were identified 
by EUSA using an anti-M13 peroxidase -conjugated antibody (Pharmaaa 
Biotech. Uppsala, Sweden), as described". 

Analyses of CDR origin. DNA-sequencing reactions were performed using 
BieDye Terminator Sequencing kit (AppUed Biosystems, Warrington. UK), 
according to the manufacturer's protocol, and were gel analyzed by GyberGene 
AB (Stockhobn. Sweden). Analysis of the GDR origin was^erformed using the 
FastA algorithm run on the Wisconsin Package (version 8: Genetics Computer 
Group Madison, W) . Comparisons of sequences amplified by the pruners used 
in these experiments were made with sequences found in the VB.ASE sequence 
dureaory (http://www.mrc-cpe.cam-ac.uk/imt-doc/public/INTRO.html). Each 
CDR was assigned a locus origin as defined by the description of the human 
IgVHand\aioci"-3*i. 

Affinity determination. The monomenc form of the antibody fragments was 
purified by size-exclusion gel chromatography on a Superose G75 FPLG 16/60 
column ( Amersham Pharmacia Biotech AB, Sweden). The affinity was deter- 
mined as described'^ 

Structure modeling- A homology model was generated for the sctv 
antibody fragments specific for FITG, MUCl peptide, and cholera toxiri 
13-subunit) (GT) using the program MODELER''. Templates were selected 
from the Brookhaven Protein Databank: llgM (FITG. GT) and lAQK 
(MUCl) for the VH domain and 2FB4 (FITG, MUCl. GT) for the VL 
domain. Tags were removed before model generation. The models were 
built by Molecular Simulations Inc. (San Diego, CA). 

Analysis of T-cell epitopes. Peptide threading'^ was used to analyze the 
presence of potential T-cell epitopes in VH and VL domains of 10 selected 
scFv fragments. The analyses were performed at Biovation Ltd. (Aberdeen. 
Scotland). For comparison, the variable domains from 10 natural human 
antibody sequences were analyzed- Peptide threading analyzes the potential 
binding of every possible 13-mer from the proteins of interest to human 
HLA-DR allotypes. The foUowing HLA-DR allotypes, which represent 
>96% of the population, were mcluded in the analysis; 1, 15(2). 16(2), 17(3), 
18(3),4. 11(5). 12(5), 13(6). 14(6), 7. 8, 9, 10, 52. 53. 103. 
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